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Abstract 


This paper presents a consensus-based formation control scheme for autonomous 
multi-agent systems represented by double integrator dynamics. Assuming that the 
information graph topology consists of an undirected connected graph, a leader-based 
consensus-type control law is presented and shown to provide asymptotic formation 
stability when subjected to piecewise constant formation velocity commands. It is 
also shown that global asymptotic stability is preserved in the presence of (0, oo)- 
sector monotonic non-decreasing actuator nonlinearities. 


1 Introduction 

Cooperative control of autonomous multi-agent systems has been a subject of con- 
siderable research in recent years. Although a large volume of literature exists on 
this subject, only a few representative references are included in the list of references 
in this paper in the interest of brevity. In particular, a comprehensive literature re- 
view may be found in [1], while [2] and [3] contain an excellent introduction. In the 
context of aviation, large numbers of autonomous vehicles are expected to operate 
in the national air space in the future. Autonomous formation flight of multiple air 
vehicles has the potential for significantly increasing airspace utilization as well as 
fuel efficiency. In other application areas, autonomous multi-agent control strate- 
gies are being investigated for optimally performing common tasks. In an abstract 
sense, a “formation” may be generically considered to be an arrangement of net- 
worked agents. An optimal formation can be designed to maximize some measure of 
performance (such as system throughput in airspace or communication networks). 

This paper investigates a consensus-based formation control scheme for multi- 
agent systems represented by double integrator agent dynamics, wherein each agent 
has the ability of bidirectional information exchange with a group of other agents 
(referred to as its ‘neighbors’), and at least one information path exists between 
every pair of agents, i.e., the information topology is represented by a connected 
undirected graph. (The reader is referred to [2] for detailed definitions) In Section 
2, a double integrator multi-agent system is defined and a consensus-type formation 
control law is presented for a leader-follower architecture wherein the leader is re- 
quired to follow a desired piecewise constant velocity command, and the followers 
are required to match the commanded velocity while maintaining the formation. To 
facilitate the analysis, 1-dimensional systems are considered initially. The results are 
readily extendable to higher dimensional motion (e.g., planar or 3-dimensional) using 
tools such as Kronecker product algebra, as presented in a latter section. Assuming 
that each follower agent has access only to the relative positions and velocities of 
its neighbors and the knowledge of the desired formation geometry, it is shown via 
Lyapunov approach that the formation is asymptotically stable, i.e., starting from 
any initial positions and velocities, the agents will asymptotically attain the desired 
formation velocity and the formation shape. In Section 3, actuator nonlinearities are 
considered and it is proved that the global asymptotic stability of the formation is 
preserved in the presence of (0, oo)-sector monotonic non-decreasing actuator non- 
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linearities. Extension to the multi-dimensional motion case is addressed in Section 
4. An example consisting of a 5-agent system is presented in Section 5, and Section 
6 contains concluding remarks. 

2 Consensus-Based Leader-Follower Type Control 

In a system consisting of N agents, suppose the dynamics of the agents are described 
by 

ii = ui, i = 1,2,..,, AT, fT (1) 

where denotes the position of the ith agent and u t is the input. (In multi- agent 
systems of this form, in general, the “position” is defined in an abstract sense (e.g., 
information state). For vehicles represented by point masses, the position is usually 
planar (n = 2) or 3-dimensional (n = 3)). Suppose Agent 1 is designated as the 
leader, and agents 2, . . . , N are the followers. In undirected graph topology, agents 
i and j are defined to be neighbors of each other if there exists an edge (which rep- 
resents information exchange) between them. The formation geometry is described 
by the relative position set V 

V = { dij := iw - £j d} (2) 

where £*£>, are the desired position states of agents i and j who are neighbors 
of each other. V is assumed to be fixed (time- invariant). The objective is to get 
all the agents to converge to the pre-defined formation shape and the commanded 
velocity starting from any initial condition, and maintain the formation shape in the 
presence of any piecewise constant velocity commands. 

The basic consensus protocol can be modeled using the degree matrix D and 
the adjacency matrix A. The degree matrix is an N x N diagonal matrix with 
Da = degree of node i, i.e, the number of neighbors of the ith agent. The N x N 
adjacency matrix denotes the connectivity between nodes (in the sense of information 
exchange), with A tJ = 1 if j is a neighbor of i; otherwise, A,j = 0 (for undirected 
graphs). For example, for the 5-agent system shown in Figure 1, 

'200001 [ 0 1 1 0 0 " 

0 3 0 0 0 1 0 1 1 0 

D = 00200 , A = 11000 (3) 

00020 01001 

OOOOlJ [o 0 0 1 0. 

The Laplacian matrix L , which plays a central role in the dynamics of the consensus 
protocol, is defined as 

L = D — A (4) 

For the example system in Figure 1, 

'2-1-1 0 O' 

-1 3-1-1 0 

L= - 1 -1 2 0 0 (5) 

0-1 0 2-1 

0 0 0 -1 1 _ 
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Figure 1. Multi-agent system 


For undirected connected graphs, the Laplacian matrix L is symmetric positive 
semidehnite and has a single zero eigenvalue [2]. Also all the row sums are zero, i.e. , 
1 = [1, 1, . . . , l\ r is an eigenvector corresponding to the zero eigenvalue. 

The aggregated (summed) relative position and velocity for agent i , y p i and y r i, 
are given by 

Vpi = 'y ] (£i — £}) 5 Uri = y ] (£i — £,j) (6) 

j6 ATi je K 

where A/) denotes the set of neighbors of agent i. For the example system in Figure 
1,M = {2,3}, = {1,3,4}, etc. 

Suppose the leader is given a constant velocity command vd, such that the 
leader’s desired position satisfies 

ilD = VD- (7) 

The objective of the system is to form and maintain a fixed formation V starting 
from any initial states of the agents for any given constant velocity command. (Note 
that the followers’ desired velocity is also the same, i.e., £ t n = vd Vi G [1, . . . , N], 
Thus, vd is the desired formation velocity, which is assumed to be piecewise constant 
(in time). 

Consider the consensus control law for the follower agent i (i = 2, . . . , N): 

V-i = — y ^ kp(^i — ^ j ) + ^Y (ii t~j) kp(^iD — £jD) k r (iiD — £j'd) (8) 

jeATi 

— [kpDpi T k r y r i] + k p y ^ dij, for i — 2, . . . , N (9) 

where k p k r are positive scalars. (Note that the last term in Eq. (8) vanishes because 

Ld = vd Vi ). 
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(10) 


The leader’s control law is given by 

U\ — Utr ^ ( &p(£l Cj) T 1 £j) kpd\j 

j£A/i 

where Uf r is the tracking component of the leader’s control law which is given by 

Utr — Cl d) T ^r(Cl Ld)] (11) 

where k p . n r are positive constants. Suppose the tracking control law gains in Eq. 
( 11 ) are given by 

k p = yfcp; K r = 7 k r (12) 

where 7 is a positive scalar. Define the matrix C as 

Ai = £11+7 

Cij = Lij for all other i,j. (13) 

where L denotes the Laplacian matrix. It is shown next that C is positive definite. 
Proposition 1 . The matrix C is positive definite. 

Proof- Since L is positive semidehnite, C is at least positive semidehnite. Also, 
since L has a single zero eigenvalue with a corresponding eigenvector <51 (where 5 
is any non-zero scalar), z T Lz > 0 Vz G $t N , and z T Lz = 0 only for z = Si. Since 
(< 5 l) T £(£l) = 7 S 2 > 0, C is positive definite. □ 

2.1 Formation Stability 

Define 

x = a-t D ( 14 ) 

Note that C_d is time- varying in view of (7). The agent dynamics can be written as: 

X = u (15) 

The control input from Eqs. (10) and (9) can be written as 

u = —kpCX — k r CX (16) 

and the closed-loop dynamics are given by 

X = —kpCX - k r CX (17) 

The asymptotic stability of the closed-loop system in (17) can be readily established 
by considering the Lyapunov function 

V = X T X + k p X T LX (18) 

which, after differentiation with respect to t and simplification, yields 

V = -2 k r X T CX (19) 

Thus V is negative semidehnite. Furthermore, V = 0 implies X = 0, which in 
turn implies (from (17)) that X = 0. Therefore, by LaSalle’s invariance principle, 
the system in (17) is asymptotically stable, i.e., the consensus-based leader-follower 
control law in (9), (10) provides asymptotic formation convergence (Vi, j. [ £i(t ) — 
£j(t)] — > dij, and — > vd, as t — > 00 ) for any initial conditions for any constant 

velocity command Vfj. 
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2 . 1.1 


Remarks 


1. The agents’ dynamics consist of double integrators, which represents, for ex- 
ample, mass-normalized dynamics of point masses. 

2. The control laws for each follower agent only needs the knowledge of the relative 
positions and velocities of its neighbors (with respect to itself), and the knowl- 
edge of the formation geometry {dij} of its neighbors. The leader’s control law 
additionally needs the desired leader position and velocity. 

3. It is assumed in this paper that the position and rate gains are the same for 
all the agents. It is possible to use different gains for each agent, but would 
require some additional conditions for stability. 


3 Stability in the Presence of Actuator Nonlinearities 

In practice, it is important to consider the fact that the actuators have limited 
control authority. In [4] , a control law that uses an inherently bounded control input 
was presented for the double integrator consensus problem. A different approach is 
used in this paper to show that the control law given in Section 2 preserves global 
asymptotic stability in spite of saturation type (and more general types of) actuator 
nonlinearities. 

Suppose each agent’s actuator has a continuous, monotonic non-decreasing, (0, oo)- 
sector nonlinearity, which results in the actual inputs u a f. 

Uai = <M«i), i = l,...,N (20) 

where 

<j<f>i(a) > 0 Vex G K, <7 / 0, and (f)(0) = 0 (21) 

Figure 2 shows an example of a (0, oo)-sector monotonic non-decreasing nonlinearity. 



Figure 2. (0, oo)-sector monotonic non-decreasing nonlinearity 


Then the closed-loop system is given by 

X = 4>(—k p Wp — k r w r ) 


( 22 ) 
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where <f)(.) denotes the vector of nonlinearities (assumed to be continuous single- 
valued functions), and 

w p = CX , w r = CX (23) 

The following theorem shows that the closed-loop system is globally asymptoti- 
cally stable in the presence of (0, oo)-sector monotonic non-decreasing actuator non- 
linearities. 

Theorem 1- The closed-loop system (22) is globally asymptotically stable if the 
actuator nonlinearities 4>i(.) are monotonic non-decreasing and belong to the (0, oo)- 
sector. 

Proof- Denoting 

= -</>i(-<r), i ■ ■ ■ ,N (24) 

it can be shown that the nonlinear functions ipi(-) are also in the (0, oo)-sector and 
monotonic non-decreasing. The closed-loop system can be expressed as 


X = —if ){k p Wp + k r w r ) 


(25) 


Consider the Lyapunov function 


9 n r k 

v = x T cx + f- V / 

k P U -Jo 


ipi(a)da 


(26) 


V is non-negative MX G 3?^, and V = 0 only when X = 0 and w p = 0, i.e., X = 0 
(from (23)), therefore V is positive definite. It is also radially unbounded. Upon 
differentiating with respect to time t and using (23), 

N 

V = —2X J £if)(k p ujp + k r w r ) + 2 Wpi^i{kpW p i) 

i = 1 

= — 2— (k r w r ) T [ip(kpWp + k r w r ) — ip(kpWp)] (27) 

rx/'p 

Since V’i(-) are monotonic nondecreasing, V < 0. Also, V = 0 would imply that w r i = 
0 Vi, or kpiWpi and ( k p iW p i + k r iVJ r i) are both in a flat non-zero region of the graph of 
each actuator nonlinearity ipi Mt. The former situation (w r = 0) would imply (from 
(23)) that X = X = 0, and therefore (from (25) and (21)) that w p = 0 and X = 0 
(from (23)). From (25), the latter situation can occur only when X (and therefore 
V) grows unbounded, which is not possible because V < 0. Therefore V cannot 
be zero along any non-zero trajectories, and the system is globally asymptotically 
stable. □ 


4 Multi-Dimensional Motion 

The results presented in Sections 2 and 3 for 1-d motion can be readily extended 
to the planar and 3-dimensional cases (or generally for the n-dimensional case for 
abstract networks). For the n-dimensional case, noting that each £ is n-dimensional, 


6 



X and u in Eq. (15) are (JV.n)-dimensional, as are w p and w r . The control input of 
(16) can be written as 

u = — k p [C < 8 > In]u>p ~ k r [C < 8 > Iv\w r (28) 

Alternatively, it may be more intuitive to express the position vector X partitioned 
by the n axes. When n = 3, dehne 

X=[Xx.xJ’,X* ] T ( 29 ) 

where 

Xx = [X\x, X 2x , ■ ■ ■ , X]y x } T (30) 

(similar for y- and z-axes). With w p , w r , u re-ordered similarly as u p , u r , v the 
closed-loop equations for each axis can be written separately as 

Xi — V — k p iUJ p i kriUJri, i — z. (31) 

Then the stability results of Sections 2 and 3 carry over in a straightforward manner. 
(Note that a different k p . k r , and 7 can be chosen for each axis). 

4.1 Effect of Rotation 

In 2-d and 3-d motion, it is often important to change the formation’s orientation to 
match the orientation of the velocity vector vp- Suppose the formation’s orientation 
is initially aligned with the velocity vector vdq, and a new formation velocity vector 
Vfji is desired, which has a different magnitude as well as orientation, expressed by 
Euler rotation angles 4>,9 ,ip about the x,y,z axes, i.e., 

vdi = T^)T 2 {9)Ti{(j))v D Q + Sv (32) 

where T,;s are the Euler rotation matrices and 5v G K 3 . If all the motion is expressed 
in the new coordinate system, the formation geometry V remains unchanged. Assum- 
ing each vehicle has the knowledge of the new coordinate system (i.e., the direction 
of the velocity command) ( 0 , 9 , ip) and that the vehicles can change their individual 
orientation to match the new coordinate system, the vehicles’ relative positions and 
velocities are measured in the new coordinate system and the control laws would 
remain unchanged. 

5 Example 

For the 5-agent system in Figure 1, suppose it is required to form and maintain a 
V-shaped formation with Agent 1 as the leader, as shown in Figure 3. The motion 
is assumed to be planar, and the agents are initially at rest and dispersed at random 
locations. Suppose a velocity command vr> = (1, —2) is given to the leader. Using 
the control laws (9), (10) with arbitrarily chosen gains k px = kpy = 10, k rx = k ry = 
5, and 7 = 1, it can be verified that all closed- loop eigenvalues have negative real 
parts. The resulting trajectories are shown in Figure 4 from t = Os to t = 10s, 
which indicate that the desired formation shape and velocity are mostly attained by 
t = 6.5s, and fully attained and maintained by t = 10s in the presence of constant 
velocity command. 
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Figure 3. Desired formation shape 



Figure 4. Agent trajectories 


6 Concluding Remarks 


A consensus-based formation control scheme was presented for multi-agent systems 
consisting of agents described by double integrator dynamics. The information topol- 
ogy of the system was assumed to consist of a connected undirected graph, and a 
leader-follower architecture was used wherein the leader is required to follow a piece- 
wise constant velocity command. It was shown that the control scheme provides 
asymptotic stability, i.e., the formation shape is asymptotically attained and main- 
tained, and the formation velocity asymptotically approaches the desired velocity. It 
was also shown that global asymptotic stability is preserved in the presence of (0, oo)- 
sector monotonic non-decreasing actuator nonlinearities, which include saturation. 
An important application of multi-agent systems theory is the multi- vehicle coopera- 
tive control problem for autonomous vehicles. Considerable further work needs to be 
done in this area to realistically address this problem and to find practical solutions. 
In the example problem presented, the controller gains were chosen arbitrarily. It 
would be highly desirable to investigate techniques for designing controller gains to 
optimize a given performance index. In addition, optimal design of the communi- 
cation structure (e.g., selecting the most effective edges in the information topology 
graph) should be investigated. Furthermore, the control laws and the stability re- 
sults need to be substantially extended, first to agents represented by inertial masses 
with translation-rotation coupling, and then to general uncertain dynamic systems, 
both linear and nonlinear, such as aircraft. An important topic that was not ad- 
dressed in this paper is collision avoidance strategies, which should be built into the 
control laws. It is also necessary to develop reliable simulation-based methods and 
experimental techniques to demonstrate, validate, and verify the control schemes. 
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